Overview
Brought to you by YData
Dataset statistics
| Training Data | Test Data | |
|---|---|---|
| Number of variables | 12 | 11 |
| Number of observations | 593994 | 254569 |
| Missing cells | 0 | 0 |
| Missing cells (%) | 0.0% | 0.0% |
| Duplicate rows | 0 | 0 |
| Duplicate rows (%) | 0.0% | 0.0% |
| Total size in memory | 54.4 MiB | 21.4 MiB |
| Average record size in memory | 96.0 B | 88.0 B |
Variable types
| Training Data | Test Data | |
|---|---|---|
| Numeric | 5 | 5 |
| Categorical | 7 | 6 |
| Training Data | Test Data | |
|---|---|---|
credit_score is highly overall correlated with grade_subgrade and 1 other fields | credit_score is highly overall correlated with grade_subgrade and 1 other fields | High correlation |
employment_status is highly overall correlated with loan_paid_back | Alert not present in this dataset | High correlation |
grade_subgrade is highly overall correlated with credit_score | grade_subgrade is highly overall correlated with credit_score | High correlation |
interest_rate is highly overall correlated with credit_score | interest_rate is highly overall correlated with credit_score | High correlation |
loan_paid_back is highly overall correlated with employment_status | Alert not present in this dataset | High correlation |
Reproduction
| Training Data | Test Data | |
|---|---|---|
| Analysis started | 2025-11-14 17:00:20.786996 | 2025-11-14 17:00:38.390723 |
| Analysis finished | 2025-11-14 17:00:38.380925 | 2025-11-14 17:00:45.493136 |
| Duration | 17.59 seconds | 7.1 seconds |
| Software version | ydata-profiling vv4.17.0 | ydata-profiling vv4.17.0 |
| Download configuration | config.json | config.json |
Variables
annual_income
Real number (ℝ)
| Training Data | Test Data | |
|---|---|---|
| Distinct | 119728 | 67287 |
| Distinct (%) | 20.2% | 26.4% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 48212.203 | 48233.08 |
| Training Data | Test Data | |
|---|---|---|
| Minimum | 6002.43 | 6011.77 |
| Maximum | 393381.74 | 380653.94 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
Quantile statistics
| Training Data | Test Data | |
|---|---|---|
| Minimum | 6002.43 | 6011.77 |
| 5-th percentile | 15450.11 | 15447.27 |
| Q1 | 27934.4 | 27950.3 |
| median | 46557.68 | 46528.98 |
| Q3 | 60981.32 | 61149.44 |
| 95-th percentile | 93534.68 | 93534.48 |
| Maximum | 393381.74 | 380653.94 |
| Range | 387379.31 | 374642.17 |
| Interquartile range (IQR) | 33046.92 | 33199.14 |
Descriptive statistics
| Training Data | Test Data | |
|---|---|---|
| Standard deviation | 26711.942 | 26719.659 |
| Coefficient of variation (CV) | 0.5540494 | 0.55396957 |
| Kurtosis | 7.0914126 | 7.1975949 |
| Mean | 48212.203 | 48233.08 |
| Median Absolute Deviation (MAD) | 17068.9 | 17123.71 |
| Skewness | 1.7195087 | 1.7210527 |
| Sum | 2.8637759 × 1010 | 1.2278647 × 1010 |
| Variance | 7.1352785 × 108 | 7.1394015 × 108 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 51351.71 | 238 | < 0.1% |
| 25499.88 | 227 | < 0.1% |
| 24113.12 | 219 | < 0.1% |
| 56547.75 | 209 | < 0.1% |
| 26386.33 | 187 | < 0.1% |
| 28991.07 | 185 | < 0.1% |
| 16077.08 | 170 | < 0.1% |
| 46949.29 | 160 | < 0.1% |
| 53981.9 | 152 | < 0.1% |
| 52628.69 | 146 | < 0.1% |
| Other values (119718) | 592101 |
| Value | Count | Frequency (%) |
| 25499.88 | 92 | < 0.1% |
| 51351.71 | 86 | < 0.1% |
| 24113.12 | 86 | < 0.1% |
| 56547.75 | 84 | < 0.1% |
| 28991.07 | 78 | < 0.1% |
| 16077.08 | 75 | < 0.1% |
| 26386.33 | 70 | < 0.1% |
| 52628.69 | 64 | < 0.1% |
| 61519.27 | 64 | < 0.1% |
| 51773.69 | 64 | < 0.1% |
| Other values (67277) | 253806 |
| Value | Count | Frequency (%) |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6026.31 | 3 | |
| 6026.47 | 1 | < 0.1% |
| 6026.71 | 1 | < 0.1% |
| 6064.78 | 1 | < 0.1% |
| 6071.69 | 1 | < 0.1% |
| 6073.15 | 1 | < 0.1% |
| 6074.92 | 1 | < 0.1% |
| 6093.55 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6011.77 | 1 | < 0.1% |
| 6018.23 | 1 | < 0.1% |
| 6018.9 | 1 | < 0.1% |
| 6026.31 | 3 | < 0.1% |
| 6073.64 | 1 | < 0.1% |
| 6100.16 | 1 | < 0.1% |
| 6100.32 | 13 | |
| 6100.33 | 1 | < 0.1% |
| 6105.99 | 2 | < 0.1% |
| 6109.87 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6011.77 | 1 | < 0.1% |
| 6018.23 | 1 | < 0.1% |
| 6018.9 | 1 | < 0.1% |
| 6026.31 | 3 | < 0.1% |
| 6073.64 | 1 | < 0.1% |
| 6100.16 | 1 | < 0.1% |
| 6100.32 | 13 | |
| 6100.33 | 1 | < 0.1% |
| 6105.99 | 2 | < 0.1% |
| 6109.87 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6002.43 | 1 | < 0.1% |
| 6008.56 | 1 | < 0.1% |
| 6026.31 | 3 | |
| 6026.47 | 1 | < 0.1% |
| 6026.71 | 1 | < 0.1% |
| 6064.78 | 1 | < 0.1% |
| 6071.69 | 1 | < 0.1% |
| 6073.15 | 1 | < 0.1% |
| 6074.92 | 1 | < 0.1% |
| 6093.55 | 1 | < 0.1% |
debt_to_income_ratio
Real number (ℝ)
| Training Data | Test Data | |
|---|---|---|
| Distinct | 526 | 506 |
| Distinct (%) | 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 0.12069589 | 0.12058304 |
| Training Data | Test Data | |
|---|---|---|
| Minimum | 0.011 | 0.011 |
| Maximum | 0.627 | 0.627 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
Quantile statistics
| Training Data | Test Data | |
|---|---|---|
| Minimum | 0.011 | 0.011 |
| 5-th percentile | 0.046 | 0.046 |
| Q1 | 0.072 | 0.072 |
| median | 0.096 | 0.096 |
| Q3 | 0.156 | 0.156 |
| 95-th percentile | 0.259 | 0.259 |
| Maximum | 0.627 | 0.627 |
| Range | 0.616 | 0.616 |
| Interquartile range (IQR) | 0.084 | 0.084 |
Descriptive statistics
| Training Data | Test Data | |
|---|---|---|
| Standard deviation | 0.068573259 | 0.06858229 |
| Coefficient of variation (CV) | 0.56814907 | 0.56875568 |
| Kurtosis | 2.33523 | 2.4084944 |
| Mean | 0.12069589 | 0.12058304 |
| Median Absolute Deviation (MAD) | 0.032 | 0.032 |
| Skewness | 1.4066799 | 1.4199971 |
| Sum | 71692.635 | 30696.704 |
| Variance | 0.0047022918 | 0.0047035305 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.09 | 11440 | 1.9% |
| 0.093 | 11160 | 1.9% |
| 0.097 | 9508 | 1.6% |
| 0.079 | 9099 | 1.5% |
| 0.094 | 8976 | 1.5% |
| 0.098 | 8647 | 1.5% |
| 0.071 | 8192 | 1.4% |
| 0.096 | 7715 | 1.3% |
| 0.063 | 7579 | 1.3% |
| 0.067 | 7373 | 1.2% |
| Other values (516) | 504305 |
| Value | Count | Frequency (%) |
| 0.093 | 4951 | 1.9% |
| 0.09 | 4947 | 1.9% |
| 0.097 | 4003 | 1.6% |
| 0.094 | 3870 | 1.5% |
| 0.079 | 3797 | 1.5% |
| 0.098 | 3784 | 1.5% |
| 0.071 | 3452 | 1.4% |
| 0.096 | 3384 | 1.3% |
| 0.063 | 3282 | 1.3% |
| 0.067 | 3144 | 1.2% |
| Other values (496) | 215955 |
| Value | Count | Frequency (%) |
| 0.011 | 169 | |
| 0.012 | 55 | < 0.1% |
| 0.013 | 127 | |
| 0.014 | 243 | |
| 0.015 | 138 | |
| 0.016 | 80 | < 0.1% |
| 0.017 | 205 | |
| 0.018 | 186 | |
| 0.019 | 61 | < 0.1% |
| 0.02 | 152 |
| Value | Count | Frequency (%) |
| 0.011 | 77 | |
| 0.012 | 20 | < 0.1% |
| 0.013 | 61 | |
| 0.014 | 96 | |
| 0.015 | 69 | |
| 0.016 | 36 | < 0.1% |
| 0.017 | 109 | |
| 0.018 | 83 | |
| 0.019 | 22 | < 0.1% |
| 0.02 | 58 |
| Value | Count | Frequency (%) |
| 0.011 | 77 | |
| 0.012 | 20 | < 0.1% |
| 0.013 | 61 | |
| 0.014 | 96 | |
| 0.015 | 69 | |
| 0.016 | 36 | < 0.1% |
| 0.017 | 109 | |
| 0.018 | 83 | |
| 0.019 | 22 | < 0.1% |
| 0.02 | 58 |
| Value | Count | Frequency (%) |
| 0.011 | 169 | |
| 0.012 | 55 | < 0.1% |
| 0.013 | 127 | |
| 0.014 | 243 | |
| 0.015 | 138 | |
| 0.016 | 80 | < 0.1% |
| 0.017 | 205 | |
| 0.018 | 186 | |
| 0.019 | 61 | < 0.1% |
| 0.02 | 152 |
credit_score
Real number (ℝ)
| Training Data | Test Data | |
|---|---|---|
| Distinct | 399 | 389 |
| Distinct (%) | 0.1% | 0.2% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 680.91601 | 681.03769 |
| Training Data | Test Data | |
|---|---|---|
| Minimum | 395 | 395 |
| Maximum | 849 | 849 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
Quantile statistics
| Training Data | Test Data | |
|---|---|---|
| Minimum | 395 | 395 |
| 5-th percentile | 582 | 582 |
| Q1 | 646 | 646 |
| median | 682 | 683 |
| Q3 | 719 | 719 |
| 95-th percentile | 767 | 767 |
| Maximum | 849 | 849 |
| Range | 454 | 454 |
| Interquartile range (IQR) | 73 | 73 |
Descriptive statistics
| Training Data | Test Data | |
|---|---|---|
| Standard deviation | 55.424956 | 55.624118 |
| Coefficient of variation (CV) | 0.08139764 | 0.081675535 |
| Kurtosis | 0.09596164 | 0.10512082 |
| Mean | 680.91601 | 681.03769 |
| Median Absolute Deviation (MAD) | 36 | 36 |
| Skewness | -0.16699288 | -0.17167056 |
| Sum | 4.0446002 × 108 | 1.7337108 × 108 |
| Variance | 3071.9257 | 3094.0425 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 678 | 6526 | 1.1% |
| 661 | 5801 | 1.0% |
| 674 | 5793 | 1.0% |
| 708 | 5661 | 1.0% |
| 681 | 5635 | 0.9% |
| 672 | 5622 | 0.9% |
| 669 | 5618 | 0.9% |
| 685 | 5557 | 0.9% |
| 713 | 5544 | 0.9% |
| 676 | 5508 | 0.9% |
| Other values (389) | 536729 |
| Value | Count | Frequency (%) |
| 678 | 2711 | 1.1% |
| 672 | 2531 | 1.0% |
| 661 | 2520 | 1.0% |
| 669 | 2457 | 1.0% |
| 708 | 2443 | 1.0% |
| 681 | 2442 | 1.0% |
| 713 | 2425 | 1.0% |
| 674 | 2354 | 0.9% |
| 676 | 2321 | 0.9% |
| 688 | 2304 | 0.9% |
| Other values (379) | 230061 |
| Value | Count | Frequency (%) |
| 395 | 2 | |
| 431 | 1 | < 0.1% |
| 435 | 2 | |
| 437 | 3 | |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 445 | 4 | |
| 446 | 2 | |
| 447 | 2 |
| Value | Count | Frequency (%) |
| 395 | 1 | < 0.1% |
| 431 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 442 | 1 | < 0.1% |
| 443 | 1 | < 0.1% |
| 445 | 3 | |
| 447 | 1 | < 0.1% |
| 449 | 1 | < 0.1% |
| 453 | 2 | |
| 459 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 395 | 1 | < 0.1% |
| 431 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 442 | 1 | < 0.1% |
| 443 | 1 | < 0.1% |
| 445 | 3 | |
| 447 | 1 | < 0.1% |
| 449 | 1 | < 0.1% |
| 453 | 2 | |
| 459 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 395 | 2 | |
| 431 | 1 | < 0.1% |
| 435 | 2 | |
| 437 | 3 | |
| 439 | 1 | < 0.1% |
| 440 | 1 | < 0.1% |
| 441 | 1 | < 0.1% |
| 445 | 4 | |
| 446 | 2 | |
| 447 | 2 |
loan_amount
Real number (ℝ)
| Training Data | Test Data | |
|---|---|---|
| Distinct | 111570 | 65199 |
| Distinct (%) | 18.8% | 25.6% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 15020.298 | 15016.753 |
| Training Data | Test Data | |
|---|---|---|
| Minimum | 500.09 | 500.05 |
| Maximum | 48959.95 | 48959.26 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
Quantile statistics
| Training Data | Test Data | |
|---|---|---|
| Minimum | 500.09 | 500.05 |
| 5-th percentile | 3139.37 | 3144.66 |
| Q1 | 10279.62 | 10248.58 |
| median | 15000.22 | 15000.22 |
| Q3 | 18858.58 | 18831.46 |
| 95-th percentile | 27139.83 | 27124.17 |
| Maximum | 48959.95 | 48959.26 |
| Range | 48459.86 | 48459.21 |
| Interquartile range (IQR) | 8578.96 | 8582.88 |
Descriptive statistics
| Training Data | Test Data | |
|---|---|---|
| Standard deviation | 6926.5306 | 6922.1652 |
| Coefficient of variation (CV) | 0.46114469 | 0.46096283 |
| Kurtosis | -0.15014223 | -0.15334262 |
| Mean | 15020.298 | 15016.753 |
| Median Absolute Deviation (MAD) | 4386.47 | 4393.43 |
| Skewness | 0.20735982 | 0.20573601 |
| Sum | 8.9219667 × 109 | 3.8227999 × 109 |
| Variance | 47976826 | 47916371 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12892.25 | 412 | 0.1% |
| 15212.88 | 338 | 0.1% |
| 16004.97 | 282 | < 0.1% |
| 1838.88 | 278 | < 0.1% |
| 17051.01 | 255 | < 0.1% |
| 15011.15 | 250 | < 0.1% |
| 18078.57 | 241 | < 0.1% |
| 12551.14 | 241 | < 0.1% |
| 18054.98 | 237 | < 0.1% |
| 8146.24 | 232 | < 0.1% |
| Other values (111560) | 591228 |
| Value | Count | Frequency (%) |
| 12892.25 | 190 | 0.1% |
| 16004.97 | 118 | < 0.1% |
| 1838.88 | 113 | < 0.1% |
| 18078.57 | 112 | < 0.1% |
| 15212.88 | 109 | < 0.1% |
| 17051.01 | 107 | < 0.1% |
| 12093.58 | 107 | < 0.1% |
| 15011.15 | 104 | < 0.1% |
| 17054.68 | 102 | < 0.1% |
| 12093.5 | 102 | < 0.1% |
| Other values (65189) | 253405 |
| Value | Count | Frequency (%) |
| 500.09 | 1 | < 0.1% |
| 500.37 | 1 | < 0.1% |
| 500.91 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.41 | 1 | < 0.1% |
| 507.42 | 1 | < 0.1% |
| 507.46 | 3 | |
| 507.86 | 1 | < 0.1% |
| 508.34 | 1 | < 0.1% |
| 508.35 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500.05 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.46 | 1 | < 0.1% |
| 508.51 | 1 | < 0.1% |
| 508.73 | 1 | < 0.1% |
| 514 | 1 | < 0.1% |
| 514.07 | 1 | < 0.1% |
| 514.16 | 1 | < 0.1% |
| 514.4 | 2 | < 0.1% |
| 514.5 | 8 |
| Value | Count | Frequency (%) |
| 500.05 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.46 | 1 | < 0.1% |
| 508.51 | 1 | < 0.1% |
| 508.73 | 1 | < 0.1% |
| 514 | 1 | < 0.1% |
| 514.07 | 1 | < 0.1% |
| 514.16 | 1 | < 0.1% |
| 514.4 | 2 | < 0.1% |
| 514.5 | 8 |
| Value | Count | Frequency (%) |
| 500.09 | 1 | < 0.1% |
| 500.37 | 1 | < 0.1% |
| 500.91 | 1 | < 0.1% |
| 502.91 | 1 | < 0.1% |
| 507.41 | 1 | < 0.1% |
| 507.42 | 1 | < 0.1% |
| 507.46 | 3 | |
| 507.86 | 1 | < 0.1% |
| 508.34 | 1 | < 0.1% |
| 508.35 | 1 | < 0.1% |
interest_rate
Real number (ℝ)
| Training Data | Test Data | |
|---|---|---|
| Distinct | 1454 | 1385 |
| Distinct (%) | 0.2% | 0.5% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 12.356345 | 12.352323 |
| Training Data | Test Data | |
|---|---|---|
| Minimum | 3.2 | 3.2 |
| Maximum | 20.99 | 21.29 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
Quantile statistics
| Training Data | Test Data | |
|---|---|---|
| Minimum | 3.2 | 3.2 |
| 5-th percentile | 9.1 | 9.07 |
| Q1 | 10.99 | 10.98 |
| median | 12.37 | 12.37 |
| Q3 | 13.68 | 13.69 |
| 95-th percentile | 15.72 | 15.72 |
| Maximum | 20.99 | 21.29 |
| Range | 17.79 | 18.09 |
| Interquartile range (IQR) | 2.69 | 2.71 |
Descriptive statistics
| Training Data | Test Data | |
|---|---|---|
| Standard deviation | 2.0089589 | 2.0176018 |
| Coefficient of variation (CV) | 0.1625852 | 0.16333783 |
| Kurtosis | 0.059797501 | 0.055029633 |
| Mean | 12.356345 | 12.352323 |
| Median Absolute Deviation (MAD) | 1.34 | 1.35 |
| Skewness | 0.049945315 | 0.043392647 |
| Sum | 7339594.9 | 3144518.6 |
| Variance | 4.0359159 | 4.0707169 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 12.31 | 2638 | 0.4% |
| 12.52 | 2436 | 0.4% |
| 13.35 | 2415 | 0.4% |
| 12.82 | 2406 | 0.4% |
| 12.23 | 2362 | 0.4% |
| 11.26 | 2318 | 0.4% |
| 11.6 | 2236 | 0.4% |
| 13.78 | 2222 | 0.4% |
| 12.09 | 2215 | 0.4% |
| 12.81 | 2209 | 0.4% |
| Other values (1444) | 570537 |
| Value | Count | Frequency (%) |
| 12.31 | 1112 | 0.4% |
| 12.82 | 1043 | 0.4% |
| 12.52 | 1005 | 0.4% |
| 13.35 | 1005 | 0.4% |
| 11.26 | 992 | 0.4% |
| 12.23 | 985 | 0.4% |
| 11.6 | 974 | 0.4% |
| 13.78 | 951 | 0.4% |
| 12.98 | 948 | 0.4% |
| 12.35 | 931 | 0.4% |
| Other values (1375) | 244623 |
| Value | Count | Frequency (%) |
| 3.2 | 1 | < 0.1% |
| 3.32 | 1 | < 0.1% |
| 3.66 | 1 | < 0.1% |
| 3.79 | 1 | < 0.1% |
| 3.81 | 3 | |
| 3.83 | 1 | < 0.1% |
| 3.89 | 2 | |
| 3.92 | 1 | < 0.1% |
| 3.98 | 2 | |
| 4.01 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.2 | 1 | |
| 3.79 | 1 | |
| 3.81 | 1 | |
| 3.97 | 1 | |
| 4 | 1 | |
| 4.11 | 1 | |
| 4.18 | 2 | |
| 4.28 | 1 | |
| 4.29 | 1 | |
| 4.3 | 1 |
| Value | Count | Frequency (%) |
| 3.2 | 1 | |
| 3.79 | 1 | |
| 3.81 | 1 | |
| 3.97 | 1 | |
| 4 | 1 | |
| 4.11 | 1 | |
| 4.18 | 2 | |
| 4.28 | 1 | |
| 4.29 | 1 | |
| 4.3 | 1 |
| Value | Count | Frequency (%) |
| 3.2 | 1 | < 0.1% |
| 3.32 | 1 | < 0.1% |
| 3.66 | 1 | < 0.1% |
| 3.79 | 1 | < 0.1% |
| 3.81 | 3 | |
| 3.83 | 1 | < 0.1% |
| 3.89 | 2 | |
| 3.92 | 1 | < 0.1% |
| 3.98 | 2 | |
| 4.01 | 1 | < 0.1% |
gender
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| Female | |
|---|---|
| Male | |
| Other | 3728 |
| Female | |
|---|---|
| Male | |
| Other | 1642 |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 6 | 6 |
| Median length | 6 | 6 |
| Mean length | 5.0371788 | 5.0394117 |
| Min length | 4 | 4 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | Female | Female |
| 2nd row | Male | Female |
| 3rd row | Male | Male |
| 4th row | Female | Female |
| 5th row | Male | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 306175 | |
| Male | 284091 | |
| Other | 3728 | 0.6% |
| Value | Count | Frequency (%) |
| Female | 131480 | |
| Male | 121447 | |
| Other | 1642 | 0.6% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Test Data
| Value | Count | Frequency (%) |
| female | 306175 | |
| male | 284091 | |
| other | 3728 | 0.6% |
| Value | Count | Frequency (%) |
| female | 131480 | |
| male | 121447 | |
| other | 1642 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 386049 | |
| a | 252927 | |
| l | 252927 | |
| F | 131480 | 10.2% |
| m | 131480 | 10.2% |
| M | 121447 | 9.5% |
| O | 1642 | 0.1% |
| t | 1642 | 0.1% |
| h | 1642 | 0.1% |
| r | 1642 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 1282878 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 386049 | |
| a | 252927 | |
| l | 252927 | |
| F | 131480 | 10.2% |
| m | 131480 | 10.2% |
| M | 121447 | 9.5% |
| O | 1642 | 0.1% |
| t | 1642 | 0.1% |
| h | 1642 | 0.1% |
| r | 1642 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 1282878 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 386049 | |
| a | 252927 | |
| l | 252927 | |
| F | 131480 | 10.2% |
| m | 131480 | 10.2% |
| M | 121447 | 9.5% |
| O | 1642 | 0.1% |
| t | 1642 | 0.1% |
| h | 1642 | 0.1% |
| r | 1642 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2992054 |
| Value | Count | Frequency (%) |
| (unknown) | 1282878 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 900169 | |
| a | 590266 | |
| l | 590266 | |
| F | 306175 | 10.2% |
| m | 306175 | 10.2% |
| M | 284091 | 9.5% |
| O | 3728 | 0.1% |
| t | 3728 | 0.1% |
| h | 3728 | 0.1% |
| r | 3728 | 0.1% |
| Value | Count | Frequency (%) |
| e | 386049 | |
| a | 252927 | |
| l | 252927 | |
| F | 131480 | 10.2% |
| m | 131480 | 10.2% |
| M | 121447 | 9.5% |
| O | 1642 | 0.1% |
| t | 1642 | 0.1% |
| h | 1642 | 0.1% |
| r | 1642 | 0.1% |
marital_status
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 4 | 4 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| Single | |
|---|---|
| Married | |
| Divorced | 21312 |
| Widowed | 6600 |
| Single | |
|---|---|
| Married | |
| Divorced | 9122 |
| Widowed | 2761 |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 8 | 8 |
| Median length | 7 | 7 |
| Mean length | 6.5496066 | 6.5499688 |
| Min length | 6 | 6 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | Single | Single |
| 2nd row | Married | Married |
| 3rd row | Single | Single |
| 4th row | Single | Single |
| 5th row | Married | Married |
Common Values
| Value | Count | Frequency (%) |
| Single | 288843 | |
| Married | 277239 | |
| Divorced | 21312 | 3.6% |
| Widowed | 6600 | 1.1% |
| Value | Count | Frequency (%) |
| Single | 123686 | |
| Married | 119000 | |
| Divorced | 9122 | 3.6% |
| Widowed | 2761 | 1.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Test Data
| Value | Count | Frequency (%) |
| single | 288843 | |
| married | 277239 | |
| divorced | 21312 | 3.6% |
| widowed | 6600 | 1.1% |
| Value | Count | Frequency (%) |
| single | 123686 | |
| married | 119000 | |
| divorced | 9122 | 3.6% |
| widowed | 2761 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 254569 | |
| e | 254569 | |
| r | 247122 | |
| d | 133644 | |
| g | 123686 | |
| l | 123686 | |
| n | 123686 | |
| S | 123686 | |
| a | 119000 | |
| M | 119000 | |
| Other values (6) | 44771 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 1667419 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 254569 | |
| e | 254569 | |
| r | 247122 | |
| d | 133644 | |
| g | 123686 | |
| l | 123686 | |
| n | 123686 | |
| S | 123686 | |
| a | 119000 | |
| M | 119000 | |
| Other values (6) | 44771 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 1667419 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 254569 | |
| e | 254569 | |
| r | 247122 | |
| d | 133644 | |
| g | 123686 | |
| l | 123686 | |
| n | 123686 | |
| S | 123686 | |
| a | 119000 | |
| M | 119000 | |
| Other values (6) | 44771 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3890427 |
| Value | Count | Frequency (%) |
| (unknown) | 1667419 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 593994 | |
| e | 593994 | |
| r | 575790 | |
| d | 311751 | |
| g | 288843 | |
| l | 288843 | |
| n | 288843 | |
| S | 288843 | |
| a | 277239 | |
| M | 277239 | |
| Other values (6) | 105048 | 2.7% |
| Value | Count | Frequency (%) |
| i | 254569 | |
| e | 254569 | |
| r | 247122 | |
| d | 133644 | |
| g | 123686 | |
| l | 123686 | |
| n | 123686 | |
| S | 123686 | |
| a | 119000 | |
| M | 119000 | |
| Other values (6) | 44771 | 2.7% |
education_level
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| Bachelor's | |
|---|---|
| High School | |
| Master's | |
| Other | 26677 |
| PhD | 11022 |
| Bachelor's | |
|---|---|
| High School | |
| Master's | |
| Other | 11325 |
| PhD | 4807 |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 11 | 11 |
| Median length | 10 | 10 |
| Mean length | 9.6411731 | 9.6415942 |
| Min length | 3 | 3 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | High School | High School |
| 2nd row | Master's | Master's |
| 3rd row | High School | Bachelor's |
| 4th row | High School | Bachelor's |
| 5th row | High School | PhD |
Common Values
| Value | Count | Frequency (%) |
| Bachelor's | 279606 | |
| High School | 183592 | |
| Master's | 93097 | 15.7% |
| Other | 26677 | 4.5% |
| PhD | 11022 | 1.9% |
| Value | Count | Frequency (%) |
| Bachelor's | 119924 | |
| High School | 78687 | |
| Master's | 39826 | 15.6% |
| Other | 11325 | 4.4% |
| PhD | 4807 | 1.9% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Test Data
| Value | Count | Frequency (%) |
| bachelor's | 279606 | |
| high | 183592 | |
| school | 183592 | |
| master's | 93097 | 12.0% |
| other | 26677 | 3.4% |
| phd | 11022 | 1.4% |
| Value | Count | Frequency (%) |
| bachelor's | 119924 | |
| high | 78687 | |
| school | 78687 | |
| master's | 39826 | 12.0% |
| other | 11325 | 3.4% |
| phd | 4807 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 293430 | |
| o | 277298 | |
| s | 199576 | 8.1% |
| c | 198611 | 8.1% |
| l | 198611 | 8.1% |
| e | 171075 | 7.0% |
| r | 171075 | 7.0% |
| a | 159750 | 6.5% |
| ' | 159750 | 6.5% |
| B | 119924 | 4.9% |
| Other values (10) | 505351 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 2454451 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 293430 | |
| o | 277298 | |
| s | 199576 | 8.1% |
| c | 198611 | 8.1% |
| l | 198611 | 8.1% |
| e | 171075 | 7.0% |
| r | 171075 | 7.0% |
| a | 159750 | 6.5% |
| ' | 159750 | 6.5% |
| B | 119924 | 4.9% |
| Other values (10) | 505351 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 2454451 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 293430 | |
| o | 277298 | |
| s | 199576 | 8.1% |
| c | 198611 | 8.1% |
| l | 198611 | 8.1% |
| e | 171075 | 7.0% |
| r | 171075 | 7.0% |
| a | 159750 | 6.5% |
| ' | 159750 | 6.5% |
| B | 119924 | 4.9% |
| Other values (10) | 505351 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5726799 |
| Value | Count | Frequency (%) |
| (unknown) | 2454451 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 684489 | |
| o | 646790 | |
| s | 465800 | 8.1% |
| c | 463198 | 8.1% |
| l | 463198 | 8.1% |
| e | 399380 | 7.0% |
| r | 399380 | 7.0% |
| a | 372703 | 6.5% |
| ' | 372703 | 6.5% |
| B | 279606 | 4.9% |
| Other values (10) | 1179552 |
| Value | Count | Frequency (%) |
| h | 293430 | |
| o | 277298 | |
| s | 199576 | 8.1% |
| c | 198611 | 8.1% |
| l | 198611 | 8.1% |
| e | 171075 | 7.0% |
| r | 171075 | 7.0% |
| a | 159750 | 6.5% |
| ' | 159750 | 6.5% |
| B | 119924 | 4.9% |
| Other values (10) | 505351 |
employment_status
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 5 | 5 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| Employed | |
|---|---|
| Unemployed | |
| Self-employed | |
| Retired | 16453 |
| Student | 11931 |
| Employed | |
|---|---|
| Unemployed | |
| Self-employed | |
| Retired | 7060 |
| Student | 5044 |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 13 | 13 |
| Median length | 8 | 8 |
| Mean length | 8.6043596 | 8.6051051 |
| Min length | 7 | 7 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | Self-employed | Employed |
| 2nd row | Employed | Employed |
| 3rd row | Employed | Employed |
| 4th row | Employed | Employed |
| 5th row | Employed | Employed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 450645 | |
| Unemployed | 62485 | 10.5% |
| Self-employed | 52480 | 8.8% |
| Retired | 16453 | 2.8% |
| Student | 11931 | 2.0% |
| Value | Count | Frequency (%) |
| Employed | 193207 | |
| Unemployed | 26715 | 10.5% |
| Self-employed | 22543 | 8.9% |
| Retired | 7060 | 2.8% |
| Student | 5044 | 2.0% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Test Data
| Value | Count | Frequency (%) |
| employed | 450645 | |
| unemployed | 62485 | 10.5% |
| self-employed | 52480 | 8.8% |
| retired | 16453 | 2.8% |
| student | 11931 | 2.0% |
| Value | Count | Frequency (%) |
| employed | 193207 | |
| unemployed | 26715 | 10.5% |
| self-employed | 22543 | 8.9% |
| retired | 7060 | 2.8% |
| student | 5044 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 333430 | |
| l | 265008 | |
| d | 254569 | |
| m | 242465 | |
| y | 242465 | |
| p | 242465 | |
| o | 242465 | |
| E | 193207 | |
| n | 31759 | 1.4% |
| S | 27587 | 1.3% |
| Other values (8) | 115173 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 2190593 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 333430 | |
| l | 265008 | |
| d | 254569 | |
| m | 242465 | |
| y | 242465 | |
| p | 242465 | |
| o | 242465 | |
| E | 193207 | |
| n | 31759 | 1.4% |
| S | 27587 | 1.3% |
| Other values (8) | 115173 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 2190593 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 333430 | |
| l | 265008 | |
| d | 254569 | |
| m | 242465 | |
| y | 242465 | |
| p | 242465 | |
| o | 242465 | |
| E | 193207 | |
| n | 31759 | 1.4% |
| S | 27587 | 1.3% |
| Other values (8) | 115173 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5110938 |
| Value | Count | Frequency (%) |
| (unknown) | 2190593 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 777892 | |
| l | 618090 | |
| d | 593994 | |
| m | 565610 | |
| y | 565610 | |
| p | 565610 | |
| o | 565610 | |
| E | 450645 | |
| n | 74416 | 1.5% |
| S | 64411 | 1.3% |
| Other values (8) | 269050 | 5.3% |
| Value | Count | Frequency (%) |
| e | 333430 | |
| l | 265008 | |
| d | 254569 | |
| m | 242465 | |
| y | 242465 | |
| p | 242465 | |
| o | 242465 | |
| E | 193207 | |
| n | 31759 | 1.4% |
| S | 27587 | 1.3% |
| Other values (8) | 115173 | 5.3% |
loan_purpose
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 8 | 8 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| Debt consolidation | |
|---|---|
| Other | |
| Car | |
| Home | |
| Education | |
| Other values (3) |
| Debt consolidation | |
|---|---|
| Other | |
| Car | |
| Home | |
| Education | |
| Other values (3) |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 18 | 18 |
| Median length | 18 | 18 |
| Mean length | 12.38077 | 12.368973 |
| Min length | 3 | 3 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | Other | Other |
| 2nd row | Debt consolidation | Other |
| 3rd row | Debt consolidation | Debt consolidation |
| 4th row | Debt consolidation | Debt consolidation |
| 5th row | Other | Business |
Common Values
| Value | Count | Frequency (%) |
| Debt consolidation | 324695 | |
| Other | 63874 | 10.8% |
| Car | 58108 | 9.8% |
| Home | 44118 | 7.4% |
| Education | 36641 | 6.2% |
| Business | 35303 | 5.9% |
| Medical | 22806 | 3.8% |
| Vacation | 8449 | 1.4% |
| Value | Count | Frequency (%) |
| Debt consolidation | 138963 | |
| Other | 27715 | 10.9% |
| Car | 24889 | 9.8% |
| Home | 18984 | 7.5% |
| Education | 15719 | 6.2% |
| Business | 15076 | 5.9% |
| Medical | 9618 | 3.8% |
| Vacation | 3605 | 1.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Test Data
| Value | Count | Frequency (%) |
| debt | 324695 | |
| consolidation | 324695 | |
| other | 63874 | 7.0% |
| car | 58108 | 6.3% |
| home | 44118 | 4.8% |
| education | 36641 | 4.0% |
| business | 35303 | 3.8% |
| medical | 22806 | 2.5% |
| vacation | 8449 | 0.9% |
| Value | Count | Frequency (%) |
| debt | 138963 | |
| consolidation | 138963 | |
| other | 27715 | 7.0% |
| car | 24889 | 6.3% |
| home | 18984 | 4.8% |
| education | 15719 | 4.0% |
| business | 15076 | 3.8% |
| medical | 9618 | 2.4% |
| vacation | 3605 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 455197 | |
| t | 324965 | |
| i | 321944 | |
| n | 312326 | |
| e | 210356 | 6.7% |
| a | 196399 | 6.2% |
| s | 184191 | 5.8% |
| c | 167905 | 5.3% |
| d | 164300 | 5.2% |
| l | 148581 | 4.7% |
| Other values (14) | 662593 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 3148757 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 455197 | |
| t | 324965 | |
| i | 321944 | |
| n | 312326 | |
| e | 210356 | 6.7% |
| a | 196399 | 6.2% |
| s | 184191 | 5.8% |
| c | 167905 | 5.3% |
| d | 164300 | 5.2% |
| l | 148581 | 4.7% |
| Other values (14) | 662593 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 3148757 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 455197 | |
| t | 324965 | |
| i | 321944 | |
| n | 312326 | |
| e | 210356 | 6.7% |
| a | 196399 | 6.2% |
| s | 184191 | 5.8% |
| c | 167905 | 5.3% |
| d | 164300 | 5.2% |
| l | 148581 | 4.7% |
| Other values (14) | 662593 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7354103 |
| Value | Count | Frequency (%) |
| (unknown) | 3148757 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1063293 | |
| t | 758354 | |
| i | 752589 | |
| n | 729783 | |
| e | 490796 | 6.7% |
| a | 459148 | 6.2% |
| s | 430604 | 5.9% |
| c | 392591 | 5.3% |
| d | 384142 | 5.2% |
| l | 347501 | 4.7% |
| Other values (14) | 1545302 |
| Value | Count | Frequency (%) |
| o | 455197 | |
| t | 324965 | |
| i | 321944 | |
| n | 312326 | |
| e | 210356 | 6.7% |
| a | 196399 | 6.2% |
| s | 184191 | 5.8% |
| c | 167905 | 5.3% |
| d | 164300 | 5.2% |
| l | 148581 | 4.7% |
| Other values (14) | 662593 |
grade_subgrade
Categorical
| Training Data | Test Data | |
|---|---|---|
| Distinct | 30 | 30 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Memory size | 4.5 MiB | 1.9 MiB |
| C3 | |
|---|---|
| C4 | |
| C2 | |
| C1 | |
| C5 | |
| Other values (25) |
| C3 | |
|---|---|
| C4 | |
| C2 | |
| C1 | |
| C5 | |
| Other values (25) |
Length
| Training Data | Test Data | |
|---|---|---|
| Max length | 2 | 2 |
| Median length | 2 | 2 |
| Mean length | 2 | 2 |
| Min length | 2 | 2 |
Unique
| Training Data | Test Data | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Training Data | Test Data | |
|---|---|---|
| 1st row | C3 | D5 |
| 2nd row | D3 | C1 |
| 3rd row | C5 | D1 |
| 4th row | F1 | C3 |
| 5th row | D1 | C1 |
Common Values
| Value | Count | Frequency (%) |
| C3 | 58695 | |
| C4 | 55957 | 9.4% |
| C2 | 54443 | 9.2% |
| C1 | 53363 | 9.0% |
| C5 | 53317 | 9.0% |
| D1 | 37029 | 6.2% |
| D3 | 36694 | 6.2% |
| D4 | 35097 | 5.9% |
| D2 | 34432 | 5.8% |
| D5 | 32101 | 5.4% |
| Other values (20) | 142866 |
| Value | Count | Frequency (%) |
| C3 | 25410 | |
| C4 | 23712 | 9.3% |
| C2 | 23334 | 9.2% |
| C1 | 22814 | 9.0% |
| C5 | 22777 | 8.9% |
| D1 | 15721 | 6.2% |
| D3 | 15639 | 6.1% |
| D4 | 14990 | 5.9% |
| D2 | 14773 | 5.8% |
| D5 | 13923 | 5.5% |
| Other values (20) | 61476 |
Length
Histogram of lengths of the category
Common Values (Plot)
Training Data
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)Test Data
Number of variable categories passes threshold (
config.plot.cat_freq.max_unique)| Value | Count | Frequency (%) |
| c3 | 58695 | |
| c4 | 55957 | 9.4% |
| c2 | 54443 | 9.2% |
| c1 | 53363 | 9.0% |
| c5 | 53317 | 9.0% |
| d1 | 37029 | 6.2% |
| d3 | 36694 | 6.2% |
| d4 | 35097 | 5.9% |
| d2 | 34432 | 5.8% |
| d5 | 32101 | 5.4% |
| Other values (20) | 142866 |
| Value | Count | Frequency (%) |
| c3 | 25410 | |
| c4 | 23712 | 9.3% |
| c2 | 23334 | 9.2% |
| c1 | 22814 | 9.0% |
| c5 | 22777 | 8.9% |
| d1 | 15721 | 6.2% |
| d3 | 15639 | 6.1% |
| d4 | 14990 | 5.9% |
| d2 | 14773 | 5.8% |
| d5 | 13923 | 5.5% |
| Other values (20) | 61476 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 118047 | |
| D | 75046 | |
| 3 | 53262 | |
| 4 | 51283 | |
| 2 | 50597 | |
| 1 | 50535 | |
| 5 | 48892 | |
| B | 30748 | 6.0% |
| E | 14703 | 2.9% |
| F | 11751 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 509138 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 118047 | |
| D | 75046 | |
| 3 | 53262 | |
| 4 | 51283 | |
| 2 | 50597 | |
| 1 | 50535 | |
| 5 | 48892 | |
| B | 30748 | 6.0% |
| E | 14703 | 2.9% |
| F | 11751 | 2.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 509138 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 118047 | |
| D | 75046 | |
| 3 | 53262 | |
| 4 | 51283 | |
| 2 | 50597 | |
| 1 | 50535 | |
| 5 | 48892 | |
| B | 30748 | 6.0% |
| E | 14703 | 2.9% |
| F | 11751 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1187988 |
| Value | Count | Frequency (%) |
| (unknown) | 509138 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 275775 | |
| D | 175353 | |
| 3 | 123538 | |
| 4 | 120203 | |
| 1 | 118761 | |
| 2 | 117635 | |
| 5 | 113857 | |
| B | 71251 | 6.0% |
| E | 34458 | 2.9% |
| F | 27301 | 2.3% |
| Value | Count | Frequency (%) |
| C | 118047 | |
| D | 75046 | |
| 3 | 53262 | |
| 4 | 51283 | |
| 2 | 50597 | |
| 1 | 50535 | |
| 5 | 48892 | |
| B | 30748 | 6.0% |
| E | 14703 | 2.9% |
| F | 11751 | 2.3% |
loan_paid_back
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 474494 | |
| 0.0 | 119500 | 20.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 1.0 | 474494 | |
| 0.0 | 119500 | 20.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1781982 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 713494 | |
| . | 593994 | |
| 1 | 474494 |
Interactions
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Training Data
Test Data
Correlations
Training Data
Test Data
Training Data
| annual_income | credit_score | debt_to_income_ratio | education_level | employment_status | gender | grade_subgrade | interest_rate | loan_amount | loan_paid_back | loan_purpose | marital_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| annual_income | 1.000 | 0.004 | 0.005 | 0.008 | 0.009 | 0.004 | 0.007 | -0.003 | -0.009 | 0.020 | 0.007 | 0.010 |
| credit_score | 0.004 | 1.000 | -0.060 | 0.007 | 0.051 | 0.008 | 0.638 | -0.517 | -0.008 | 0.232 | 0.008 | 0.011 |
| debt_to_income_ratio | 0.005 | -0.060 | 1.000 | 0.006 | 0.088 | 0.004 | 0.024 | 0.026 | -0.012 | 0.334 | 0.006 | 0.004 |
| education_level | 0.008 | 0.007 | 0.006 | 1.000 | 0.012 | 0.004 | 0.013 | 0.008 | 0.005 | 0.025 | 0.011 | 0.008 |
| employment_status | 0.009 | 0.051 | 0.088 | 0.012 | 1.000 | 0.003 | 0.052 | 0.025 | 0.010 | 0.657 | 0.015 | 0.006 |
| gender | 0.004 | 0.008 | 0.004 | 0.004 | 0.003 | 1.000 | 0.009 | 0.004 | 0.010 | 0.007 | 0.007 | 0.002 |
| grade_subgrade | 0.007 | 0.638 | 0.024 | 0.013 | 0.052 | 0.009 | 1.000 | 0.192 | 0.013 | 0.228 | 0.008 | 0.013 |
| interest_rate | -0.003 | -0.517 | 0.026 | 0.008 | 0.025 | 0.004 | 0.192 | 1.000 | -0.001 | 0.129 | 0.006 | 0.006 |
| loan_amount | -0.009 | -0.008 | -0.012 | 0.005 | 0.010 | 0.010 | 0.013 | -0.001 | 1.000 | 0.013 | 0.008 | 0.008 |
| loan_paid_back | 0.020 | 0.232 | 0.334 | 0.025 | 0.657 | 0.007 | 0.228 | 0.129 | 0.013 | 1.000 | 0.025 | 0.001 |
| loan_purpose | 0.007 | 0.008 | 0.006 | 0.011 | 0.015 | 0.007 | 0.008 | 0.006 | 0.008 | 0.025 | 1.000 | 0.010 |
| marital_status | 0.010 | 0.011 | 0.004 | 0.008 | 0.006 | 0.002 | 0.013 | 0.006 | 0.008 | 0.001 | 0.010 | 1.000 |
Test Data
| annual_income | credit_score | debt_to_income_ratio | education_level | employment_status | gender | grade_subgrade | interest_rate | loan_amount | loan_purpose | marital_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| annual_income | 1.000 | 0.005 | -0.000 | 0.010 | 0.010 | 0.000 | 0.008 | -0.006 | -0.005 | 0.007 | 0.010 |
| credit_score | 0.005 | 1.000 | -0.061 | 0.008 | 0.053 | 0.007 | 0.639 | -0.517 | -0.010 | 0.008 | 0.012 |
| debt_to_income_ratio | -0.000 | -0.061 | 1.000 | 0.008 | 0.087 | 0.007 | 0.024 | 0.027 | -0.013 | 0.006 | 0.005 |
| education_level | 0.010 | 0.008 | 0.008 | 1.000 | 0.012 | 0.001 | 0.011 | 0.006 | 0.008 | 0.011 | 0.007 |
| employment_status | 0.010 | 0.053 | 0.087 | 0.012 | 1.000 | 0.005 | 0.053 | 0.025 | 0.008 | 0.014 | 0.004 |
| gender | 0.000 | 0.007 | 0.007 | 0.001 | 0.005 | 1.000 | 0.008 | 0.004 | 0.008 | 0.003 | 0.002 |
| grade_subgrade | 0.008 | 0.639 | 0.024 | 0.011 | 0.053 | 0.008 | 1.000 | 0.193 | 0.013 | 0.007 | 0.015 |
| interest_rate | -0.006 | -0.517 | 0.027 | 0.006 | 0.025 | 0.004 | 0.193 | 1.000 | -0.000 | 0.005 | 0.006 |
| loan_amount | -0.005 | -0.010 | -0.013 | 0.008 | 0.008 | 0.008 | 0.013 | -0.000 | 1.000 | 0.008 | 0.012 |
| loan_purpose | 0.007 | 0.008 | 0.006 | 0.011 | 0.014 | 0.003 | 0.007 | 0.005 | 0.008 | 1.000 | 0.009 |
| marital_status | 0.010 | 0.012 | 0.005 | 0.007 | 0.004 | 0.002 | 0.015 | 0.006 | 0.012 | 0.009 | 1.000 |
Missing values
Training Data
A simple visualization of nullity by column.
Test Data
A simple visualization of nullity by column.
Training Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Test Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 29,367.990 | 0.084 | 736 | 2,528.420 | 13.670 | Female | Single | High School | Self-employed | Other | C3 | 1.000 |
| 1 | 22,108.020 | 0.166 | 636 | 4,593.100 | 12.920 | Male | Married | Master's | Employed | Debt consolidation | D3 | 0.000 |
| 2 | 49,566.200 | 0.097 | 694 | 17,005.150 | 9.760 | Male | Single | High School | Employed | Debt consolidation | C5 | 1.000 |
| 3 | 46,858.250 | 0.065 | 533 | 4,682.480 | 16.100 | Female | Single | High School | Employed | Debt consolidation | F1 | 1.000 |
| 4 | 25,496.700 | 0.053 | 665 | 12,184.430 | 10.210 | Male | Married | High School | Employed | Other | D1 | 1.000 |
| 5 | 44,940.300 | 0.058 | 653 | 12,159.920 | 12.240 | Male | Single | Bachelor's | Employed | Other | D1 | 1.000 |
| 6 | 61,574.160 | 0.042 | 696 | 16,907.710 | 13.520 | Other | Single | High School | Self-employed | Debt consolidation | C5 | 1.000 |
| 7 | 45,953.310 | 0.100 | 654 | 10,111.620 | 12.820 | Female | Married | High School | Employed | Home | D1 | 1.000 |
| 8 | 30,592.290 | 0.132 | 713 | 7,522.360 | 9.480 | Male | Married | Bachelor's | Employed | Education | C5 | 1.000 |
| 9 | 17,342.450 | 0.121 | 548 | 9,653.480 | 16.040 | Female | Married | Bachelor's | Self-employed | Vacation | F1 | 1.000 |
Test Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 28,781.050 | 0.049 | 626 | 11,461.420 | 14.730 | Female | Single | High School | Employed | Other | D5 |
| 1 | 46,626.390 | 0.093 | 732 | 15,492.250 | 12.850 | Female | Married | Master's | Employed | Other | C1 |
| 2 | 54,954.890 | 0.367 | 611 | 3,796.410 | 13.290 | Male | Single | Bachelor's | Employed | Debt consolidation | D1 |
| 3 | 25,644.630 | 0.110 | 671 | 6,574.300 | 9.570 | Female | Single | Bachelor's | Employed | Debt consolidation | C3 |
| 4 | 25,169.640 | 0.081 | 688 | 17,696.890 | 12.800 | Female | Married | PhD | Employed | Business | C1 |
| 5 | 45,302.900 | 0.060 | 675 | 8,106.780 | 13.740 | Female | Married | High School | Employed | Vacation | C3 |
| 6 | 27,676.470 | 0.061 | 714 | 8,242.260 | 13.870 | Female | Single | High School | Employed | Debt consolidation | C4 |
| 7 | 38,216.910 | 0.095 | 719 | 3,765.500 | 15.100 | Male | Single | High School | Employed | Other | C5 |
| 8 | 25,650.590 | 0.101 | 664 | 20,310.640 | 11.740 | Male | Single | High School | Employed | Education | D4 |
| 9 | 62,497.030 | 0.207 | 651 | 5,177.580 | 13.900 | Female | Divorced | High School | Unemployed | Car | D2 |
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 593984 | 36,169.340 | 0.091 | 676 | 9,986.830 | 14.180 | Female | Married | Bachelor's | Retired | Debt consolidation | C3 | 1.000 |
| 593985 | 37,188.430 | 0.170 | 718 | 17,056.520 | 10.470 | Female | Married | Bachelor's | Employed | Home | C3 | 1.000 |
| 593986 | 25,015.350 | 0.074 | 633 | 15,922.610 | 13.910 | Male | Married | Bachelor's | Employed | Debt consolidation | D2 | 0.000 |
| 593987 | 17,662.680 | 0.074 | 679 | 19,792.920 | 15.480 | Female | Single | Other | Employed | Debt consolidation | C3 | 1.000 |
| 593988 | 15,602.220 | 0.056 | 622 | 25,706.470 | 15.750 | Female | Married | High School | Employed | Debt consolidation | D2 | 1.000 |
| 593989 | 23,004.260 | 0.152 | 703 | 20,958.370 | 10.920 | Female | Single | High School | Employed | Business | C3 | 1.000 |
| 593990 | 35,289.430 | 0.105 | 559 | 3,257.240 | 14.620 | Male | Single | Bachelor's | Employed | Debt consolidation | F5 | 1.000 |
| 593991 | 47,112.640 | 0.072 | 675 | 929.270 | 14.130 | Female | Married | Bachelor's | Employed | Debt consolidation | C1 | 1.000 |
| 593992 | 76,748.440 | 0.067 | 740 | 16,290.400 | 9.870 | Male | Single | Bachelor's | Employed | Debt consolidation | B2 | 1.000 |
| 593993 | 48,959.520 | 0.096 | 752 | 7,707.730 | 10.310 | Male | Married | High School | Employed | Education | B3 | 1.000 |
Test Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 254559 | 30,351.940 | 0.061 | 683 | 17,247.500 | 12.310 | Male | Married | High School | Employed | Debt consolidation | C3 |
| 254560 | 29,792.120 | 0.111 | 648 | 23,495.020 | 14.630 | Male | Married | High School | Employed | Debt consolidation | D1 |
| 254561 | 24,451.560 | 0.086 | 752 | 28,920.260 | 8.540 | Female | Married | Bachelor's | Employed | Business | B3 |
| 254562 | 35,388.910 | 0.169 | 661 | 27,300.010 | 11.930 | Female | Married | High School | Employed | Debt consolidation | D4 |
| 254563 | 17,349.850 | 0.097 | 704 | 12,596.910 | 11.960 | Female | Single | Bachelor's | Employed | Debt consolidation | C4 |
| 254564 | 92,835.970 | 0.068 | 744 | 29,704.000 | 13.480 | Female | Single | Bachelor's | Employed | Debt consolidation | B2 |
| 254565 | 48,846.470 | 0.091 | 634 | 20,284.330 | 9.580 | Female | Married | High School | Employed | Debt consolidation | D4 |
| 254566 | 20,668.520 | 0.096 | 718 | 26,387.550 | 9.000 | Male | Single | Master's | Employed | Debt consolidation | C4 |
| 254567 | 34,105.090 | 0.094 | 739 | 11,107.360 | 9.810 | Male | Single | Bachelor's | Employed | Business | C2 |
| 254568 | 45,627.530 | 0.118 | 624 | 19,246.140 | 11.640 | Female | Married | High School | Employed | Car | D3 |
Duplicate rows
Training Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | loan_paid_back | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | |||||||||||||
Test Data
| annual_income | debt_to_income_ratio | credit_score | loan_amount | interest_rate | gender | marital_status | education_level | employment_status | loan_purpose | grade_subgrade | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | ||||||||||||